Search CORE

21 research outputs found

Using registries to integrate bioinformatics tools and services into workbench environments

Author: Ison Jon
Kalaš Matúš
Ménager Hervé
Rapacki Kristoffer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

The diversity and complexity of bioinformatics resources presents significant challenges to their localisation, deployment and use, creating a need for reliable systems that address these issues. Meanwhile, users demand increasingly usable and integrated ways to access and analyse data, especially within convenient, integrated “workbench” environments. Resource descriptions are the core element of registry and workbench systems, which are used to both help the user find and comprehend available software tools, data resources, and Web Services, and to localise, execute and combine them. The descriptions are, however, hard and expensive to create and maintain, because they are volatile and require an exhaustive knowledge of the described resource, its applicability to biological research, and the data model and syntax used to describe it. We present here the Workbench Integration Enabler, a software component that will ease the integration of bioinformatics resources in a workbench environment, using their description provided by the existing ELIXIR Tools and Data Services Registry

University of Bergen

Springer - Publisher Connector

NORA - Norwegian Open Research Archives

Online Research Database In Technology

FurIOS: a web-based tool for identification of Vibrionaceae species using the fur gene

Author: Cardoso Joao
Giubergia Sonia
Gram Lone
Machado Henrique
Rapacki Kristoffer
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Gene based methods for identification of species from the Vibrionaceae family have been developed during the last decades to address the limitations of the commonly used 16S rRNA gene phylogeny. Recently, we found that the ferric-uptake regulator gene (fur) can be used as a single identification marker providing species discrimination, consistent with multi-locus sequencing analyses and whole genome phylogenies. To allow for broader and easy use of this marker, we have developed an online prediction service that allows the identification of Vibrionaceae species based on their fur-sequence. The input is a DNA sequence that can be uploaded on the web service; the output is a table containing the strain identifier, e-value, and percentage of identity for each of the matches with rows colored in green for hits with high probability of being the same species. The service is available on the web at: http://www.cbs.dtu.dk/services/furIOS-1.0/. The fur-sequences can be derived either from genome sequences or from PCR-amplification of the genomic region encoding the fur gene. We have used 191 strains identified as Vibrionaceae based on 16S rRNA gene sequence to test the PCR method and the web service on a large dataset. We were able to classify 171 of 191 strains at the species level and 20 strains remained unclassified. Furthermore, the fur phylogenetics and subsequent in silico DNA-DNA hybridization demonstrated that two strains (ATCC 33789 and ZS-139) previously identified as Vibrio splendidus are more closely related to V. tasmaniensis and V. cyclitrophicus, respectively. FurIOS is an easy-to-use online service that allows the identification of bacteria from the Vibrionaceae family at the species level using the fur gene as a single marker. Its simplistic design and straightforward pipeline makes it suitable for any research environment, from academia to industry

Frontiers - Publisher Connector

PubMed Central

Online Research Database In Technology

Tools and data services registry: a community effort to document bioinformatics resources

Author: et al. 69 authors
Ison Jon
Kalaš Matúš
Ménager Hervé
Rapacki Kristoffer
Publication venue: 'Oxford University Press (OUP)'
Publication date: 24/08/2016
Field of study

Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand. Here we present a community-driven curation effort, supported by ELIXIR––the European infrastructure for biological information––that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners. As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools.publishedVersio

University of Bergen

FeatureMap3D—a tool to map protein features and sequence conservation onto homologous structures in the PDB

Author: Mølgaard Anne
Rapacki Kristoffer
Sackett Peter Wad
Stærfeldt Hans-Henrik
Wernersson Rasmus
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

FeatureMap3D is a web-based tool that maps protein features onto 3D structures. The user provides sequences annotated with any feature of interest, such as post-translational modifications, protease cleavage sites or exonic structure and FeatureMap3D will then search the Protein Data Bank (PDB) for structures of homologous proteins. The results are displayed both as an annotated sequence alignment, where the user-provided annotations as well as the sequence conservation between the query and the target sequence are displayed, and also as a publication-quality image of the 3D protein structure with the selected features and sequence conservation enhanced. The results are also returned in a readily parsable text format as well as a PyMol () script file, which allows the user to easily modify the protein structure image to suit a specific purpose. FeatureMap3D can also be used without sequence annotation, to evaluate the quality of the alignment of the input sequences to the most homologous structures in the PDB, through the sequence conservation colored 3D structure visualization tool. FeatureMap3D is available at:

PubMed Central

Copenhagen University Research Information System

Online Research Database In Technology

The Danish ELIXIR Node

Author: Brunak Søren
Løngreen Peter
Rapacki Kristoffer
Sperotto Maria Maddalena
Publication venue
Publication date: 01/01/2015
Field of study

Online Research Database In Technology

The EMBRACE web service collection

The EMBRACE (European Model for Bioinformatics Research and Community Education) web service collection is the culmination of a 5-year project that set out to investigate issues involved in developing and deploying web services for use in the life sciences. The project concluded that in order for web services to achieve widespread adoption, standards must be defined for the choice of web service technology, for semantically annotating both service function and the data exchanged, and a mechanism for discovering services must be provided. Building on this, the project developed: EDAM, an ontology for describing life science web services; BioXSD, a schema for exchanging data between services; and a centralized registry (http://www.embraceregistry.net) that collects together around 1000 services developed by the consortium partners. This article presents the current status of the collection and its associated recommendations and standards definition

RERO DOC Digital Library

Novel variation and <i>de novo </i>mutation rates in population-wide <i>de novo</i> assembled Danish trios

Author: Belling Kirstine C.
Besenbacher Søren
Bolund Lars
Bork-Jensen Jette
Brunak Søren
Børglum Anders D.
Cheng Xiaofang
Chmura Piotr Jaroslaw
Dam-Als Thomas
Demontis Ditte
Dworzynski Piotr
Eiberg Hans
Flores Santa Cruz David
Friborg Rune M.
Gonzalez-Izarzugaza Jose Maria
Grove Jakob
Gupta Ramneek
Hansen Torben
Huang Shujia
Jun Wang
Kristiansen Karsten
Krogh Anders
Lescai Francesco
Li Ning
Li Shengting
Liu Hao
Liu Siyang
Lund Ole
Mailund Thomas
Pedersen Christian N. S.
Pedersen Oluf
Rao Junhua
Rapacki Kristoffer
Rasmussen Simon
Rubio García Arcadio
Rydza Emil Karol
Schierup Mikkel H
Sun Jihua
Sørensen John Damm
Sørensen Thorkild I. A.
Villesen Palle
Wang Ou
Westergaard David
Xu Ruiqi
Xu Xun
Yadav Rachita
Yadav Rachita
Ye Weijian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Building a population-specific catalogue of single nucleotide variants (SNVs), indels and structural variants (SVs) with frequencies, termed a national pan-genome, is critical for further advancing clinical and public health genetics in large cohorts. Here we report a Danish pan-genome obtained from sequencing 10 trios to high depth (50 × ). We report 536k novel SNVs and 283k novel short indels from mapping approaches and develop a population-wide de novo assembly approach to identify 132k novel indels larger than 10 nucleotides with low false discovery rates. We identify a higher proportion of indels and SVs than previous efforts showing the merits of high coverage and de novo assembly approaches. In addition, we use trio information to identify de novo mutations and use a probabilistic method to provide direct estimates of 1.27e−8 and 1.5e−9 per nucleotide per generation for SNVs and indels, respectively

Copenhagen University Research Information System

PubMed Central

Online Research Database In Technology

Tools and data services registry: a community effort to document bioinformatics resources

Author: Anthon Christian
Beard Niall
Berka Karel
Bolser Dan
Booth Tim
Bretaudeau Anthony
Brezovsky Jan
Brunak Søren
Casadio Rita
Cesareni Gianni
Chmura Piotr
Coppens Frederik
Cornell Michael
Cuccuru Gianmauro
Davidsen Kristian
de la Torre Victor
Dogan Tunca
Doppelt-Azeroual Olivia
Emery Laura
Friborg Rune Møllegaard
Gasteiger Elisabeth
Gatter Thomas
Goldberg Tatyana
Grosjean Marie
Grüning Björn
Helmer-Citterich Manuela
Ienasescu Hans
Ioannidis Vassilios
Ison Jon
Jespersen Martin Closter
Jimenez Rafael
Juty Nick
Juvan Peter
Kalaš Matúš
Koch Maximilian
Laibe Camille
Li Jing-Woei
Licata Luana
Løngreen Peter
Mareuil Fabien
Mičetić Ivan
Moretti Sebastien
Morris Chris
Ménager Hervé
Möller Steffen
Nenadic Aleksandra
Parkinson Helen
Peterson Hedi
Profiti Giuseppe
Rapacki Kristoffer
Rice Peter
Romano Paolo
Roncaglia Paola
Rost Burkhard
Rydza Emil
Saidi Rabie
Schafferhans Andrea
Schwämmle Veit
Smith Callum
Sperotto Maria Maddalena
Stockinger Heinz
Tosatto Silvio C.E.
Uva Paolo
Vařeková Radka Svobodová
Vedova Gianluca Della
Via Allegra
Vriend Gert
Yachdav Guy
Zambelli Federico
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand. Here we present a community-driven curation effort, supported by ELIXIR—the European infrastructure for biological information—that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners. As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools

HAL Descartes

Online Research Database In Technology

Hal-Diderot

Archivio istituzionale della ricerca - Università di Padova

NERC Open Research Archive

Ghent University Academic Bibliography

Copenhagen University Research Information System

PubMed Central

Archivsystem Ask23

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

University of Southern Denmark Research Output

Archivio della ricerca- Università di Roma La Sapienza

HAL-Rennes 1

O-GLYCBASE Version 2.0 A revised database of O-glycosylated proteins

Author: Jan Hansen
Kristoffer Rapacki
Ole Lund
Publication venue
Publication date
Field of study

O-GLYCBASE is an updated database of information on glycoproteins and their O-linked glycosylation sites. Entries are compiled and revised from the literature, and from the SWISS-PROT database. Entries include information about species, sequence, glycosylation sites and glycan type. O-GLYCBASE is now fully cross-referenced to the SWISS-PROT, PIR, PROSITE, PDB, EMBL, HSSP, LISTA and MIM databases. Compared to version 1.0 the number of entries have increased by 34 %. Revisions of the O-glycan assignments was performed on 20 % of the entries. Sequence logos displaying the acceptor specificity patterns for the GalNAc, mannose and GlcNAc transferases are shown. The O-GLYCBASE database is available through WWW or by anonymous FTP. 3 INTRODUCTION Although it is not yet fully acknowledged, it is likely that most secreted proteins are glycosylated. One type is O-glycosylation, which is a post-translational event, where a carbohydrate is covalently linked to the hydroxyl group of serine or th..

CiteSeerX

O-GLYCBASE version 2.0: a revised database of O-glycosylated proteins

Author: Brunak Søren
Hansen Jan
Lund Ole
Rapacki Kristoffer
Publication venue
Publication date: 01/01/1997
Field of study

Online Research Database In Technology